SEQUENCE LISTING 


<110> Tomb, Jean-Francois 
Bramucci, Michael G. 
Cheng, Qiong 
Kostichka, Kristy N. 

<120> Rhodococcus Cloning and Expression Vectors 

<130> CL1709 US NA 

<150> 60/254,868 
<151> 2000-12-12 

<160> 30 

s ' s <170> Microsoft Office 97 

O <210> 1 

Ti <211> 1140 

H! <212> DNA 

ru 

y <213> Rhodococcus AN12 


atgaccagcg taagtgctga acacctttcc ggcaaagacc ggcctcccgt cctcgtgtcg 60 

0 tccgataagc gcggcatccg gcacgaactg cgacccaaac ttcaacaaat caccacgtca 120 

gaaacattta acgcctgtgg ccggccgatt tctggcgtga acggtgtgac cattgtcaac 180 

ggtccgaaag gttctggatt cggaggcctt cgttcctgcg gaaagggctg gatctgcccc 24 0 

tgctgtgcgg gaaaagtcgg tgcacatcgt gcagacgaaa tttctcaagt tgttgctcat 300 

caactcggga ctggatctgt tgcgatggtg acgatgacca tgcgccatac agctggtcag 360 

cggctccacg acctatggac tggactttcg gcagcctgga aagctgcgac caacggtcgt 420 

cgttggcgta cggaacgtga aatgtacggc tgcgacggat acgtgcgcgc tgttgaaatc 480 

actcacggaa aaaacggctg gcacgtccac gttcacgcgc tactcatgtt cagtggtgac 540 

gtgagtgaga acatcctcga atccttctcg gatgcgatgt tcgatcggtg gacttccaaa 600 

ctcgtatctc tgggatttgc tgcgccacta cgtaattcgg gtggtctcga tgtacgaaag 660 

atcggcggtg aagctgatca agttctcgct gcgtatctga cgaaaattgc atctggcgtt 720 

ggtatggagg ttggtagtgg cgacggaaaa agtggtcgac atggcaaccg tgcaccctgg 7 80 

gaaatcgctg ttgatgcagt gggcggggat ccacaagcgt tggaactgtg gcgagaattt 840 

1 


gagtttggtt cgatgggacg tcgggcaatc gcgtggtccc gtggattgcg tgcccgagct 900 

ggtcttgggg cagaactaac agatgctcag atcgttgagc aggaagaatc tgccccggtc 960 

atggttgcga tcattccggc gcgatcgtgg atgatgattc ggacttgtgc gccttacgtc 1020 

ttcggcgaga tcctcggact cgtcgaagct ggcgcgactt gggaaaatct tcgtgatcac 1080 

ttgcattatc gattgcccgc agcggatgtg cggcccccga taatatcggt tcgcaagtga 1140 

<210> 2 

<211> 379 

<212> PRT 

<213> Rhodococcus AN12 


Met Thr Ser Val Ser Ala Glu His Leu Ser Gly Lys Asp Arg Pro Pro 
15 10 15 


Val Leu Val Ser Ser Asp Lys Arg Gly He Arg His Glu Leu Arg Pro 
20 25 30 

Lys Leu Gin Gin He Thr Thr Ser Glu Thr Phe Asn Ala Cys Gly Arg 
35 40 45 

Pro He Ser Gly Val Asn Gly Val Thr He Val Asn Gly Pro Lys Gly 
50 " 55 60 

Ser Gly Phe Gly Gly Leu Arg Ser Cys Gly Lys Gly Trp He Cys Pro 
65 70 75 80 

Cys Cys Ala Gly Lys Val Gly Ala His Arg Ala Asp Glu He Ser Gin 


Val Val Ala His Gin Leu Gly Thr Gly Ser Val Ala Met Val Thr Met 
100 105 HO 

Thr Met Arg His Thr Ala Gly Gin Arg Leu His Asp Leu Trp Thr Gly 
115 120 125 

Leu Ser Ala Ala Trp Lys Ala Ala Thr 'Asn Gly Arg Arg Trp Arg Thr 
130 135 140 

Glu Arg Glu Met Tyr Gly Cys Asp Gly Tyr Val Arg Ala Val Glu He 
145 150 155 160 


2 


Thr His Gly Lys Asn Gly Trp His Val His Val His Ala Leu Leu Met 
165 170 175 


Phe Ser Gly Asp Val Ser Glu Asn lie Leu Glu Ser Phe Ser Asp Ala 
180 185 190 

Met Phe Asp Arg Trp Thr Ser Lys Leu Val Ser Leu Gly Phe Ala Ala 
195 200 205 

Pro Leu Arg Asn Ser Gly Gly Leu Asp Val Arg Lys lie Gly Gly Glu 
210 215 220 

Ala Asp Gin Val Leu Ala Ala Tyr Leu Thr Lys lie Ala Ser Gly Val 
225 230 235 240 

Gly Met Glu Val Gly Ser Gly Asp Gly Lys Ser Gly Arg His Gly Asn 
245 250 255 

Arg Ala Pro Trp Glu lie Ala Val Asp Ala Val Gly Gly Asp Pro Gin 
260 265 270 

Ala Leu Glu Leu Trp Arg Glu Phe Glu Phe Gly Ser Met Gly Arg Arg 
275 280 285 

Ala lie Ala Trp Ser Arg Gly Leu Arg Ala Arg Ala Gly Leu Gly Ala 
290 ~ 295 300 

Glu Leu Thr Asp Ala Gin He Val Glu Gin Glu Glu Ser Ala Pro Val 
305 310 315 320 

Met Val Ala He He Pro Ala Arg Ser Trp Met Met He Arg Thr Cys 
325 330 335 

Ala Pro Tyr Val Phe Gly Glu He Leu Gly Leu Val Glu Ala Gly Ala 
340 345 350 

Thr Trp Glu Asn Leu Arg Asp His Leu His Tyr Arg Leu Pro Ala Ala 
355 360 365 


Asp Val Arg Pro Pro He He Ser Val Arg Lys 
370 375 


<210> 3 

<211> 891 

<212> DNA 

<213> Rhodococcus AN12 


3 


<400> 3 

atggatcaaa cagacacgat cccgattgcg attggatgga acgaactagc tcaacctgtc 60 

atggtcgata tagccaaaga tgctgctcac tggctcattc aaggcaaaac ccgttccgga 120 

aaatctcaat gcacctacaa cctgctcgca caggctggat cgaatcccgc tgtgcgtgtc 180 

gtcggagtcg atcccacttc cgtcttacta gccccattcg tccaccgacg accggctgaa 240 

ccgaacatcg agctcgggct gaacgatttt gacaaagtcc tccgagtgct ccagttcgtc 300 

aaagcagaat ctgaccgacg aatcgagtgt ttctgggatc gacgcataga caaaatttcg 360 

ttgttctcgc cagcactacc tctcatcctg ctcgtactgg aagaatttcc cggaatcatc 420 

gagggcgcac aggatttcga tgcaaccaac ggtctgaaac cagcagacag atacgcaccc 480 

cgcatcacat cgcttgttcg acagatcgct gctcagtctg ccaaagcagg catcagaatg 540 

ttgctcttgg ctcaacgtgc ggaagcttcc atcgtgggtg gaaacgcccg ctcgaacttc 600 

gcggtgaaaa tgactctccg cgtagacgaa cctgaatctg tcaaaatgct gcaccccaac 660 

S3 gcaacacctg aagagtgcgc actggtcgaa ggattcgtcc ctggtcaagg cttcttcgac 720 

□ caacccggac tacggcgcca aatgatccga acggttcgcg taggtgagta ctcgacctac 78 0 

:p gcgagttacg tcgaaaacgc agacctcgcg tacgaagccg cactgaacat cgaccgagca 840 

caacgaatga caatcgcctc ggaataccca catctcggcg acataggctg a 8 91 

U <210> 4 

% <211> 296 


<213> Rhodococcus AN12 
<400> 4 

Met Asp Gin Thr Asp Thr lie Pro He Ala He Gly Trp Asn Glu Leu 
15 10 15 

Ala Gin Pro Val Met Val Asp He Ala Lys Asp Ala Ala His Trp Leu 
20 25 30 

He Gin Gly Lys Thr Arg Ser Gly Lys Ser Gin Cys Thr Tyr Asn Leu 
35 40 45 

Leu Ala Gin Ala Gly Ser Asn Pro Ala Val Arg Val Val Gly Val Asp 
50 55 60 

Pro Thr Ser Val Leu Leu Ala Pro Phe Val His Arg Arg Pro Ala Glu 
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Pro Asn lie Glu Leu Gly Leu Asn Asp Phe Asp Lys Val Leu Arg Val 
85 90 95 

Leu Gin Phe Val Lys Ala Glu Ser Asp Arg Arg lie Glu Cys Phe Trp 
100 105 110 

Asp Arg Arg lie Asp Lys lie Ser Leu Phe Ser Pro Ala Leu Pro Leu 
115 " 120 125 

lie Leu Leu Val Leu Glu Glu Phe Pro Gly lie lie Glu Gly Ala Gin 
130 135 140 

Asp Phe Asp Ala Thr Asn Gly Leu Lys Pro Ala Asp Arg Tyr Ala Pro 
145 150 155 160 

Arg lie Thr Ser Leu Val Arg Gin lie Ala Ala Gin Ser Ala Lys Ala 
165 ' 170 175 

03 Gly lie Arg Met Leu Leu Leu Ala Gin Arg Ala Glu Ala Ser lie Val 

Q " 180 185 190 


iu 


Gly Gly Asn Ala Arg Ser Asn Phe Ala Val Lys Met Thr Leu Arg Val 


=5 5 V3_l__y n^ll ^J-U '^y ^v-j- ^ ^_ — . ~- _j _ 

itf 195 200 205 


195 

Asp Glu Pro Glu Ser Val Lys Met Leu His Pro Asn Ala Thr Pro Glu 
210 215 220 


M Glu Cys Ala Leu Val Glu Gly Phe Val Pro Gly Gin Gly Phe Phe Asp 

Q 225 230 235 240 

Gin Pro Gly Leu Arg Arg Gin Met lie Arg Thr Val Arg Val Gly Glu 
245 250 255 

Tyr Ser Thr Tyr Ala Ser Tyr Val Glu Asn Ala Asp Leu Ala Tyr Glu 
260 J 265 270 

Ala Ala Leu Asn lie Asp Arg Ala Gin Arg Met Thr lie Ala Ser Glu 
275 280 285 

Tyr Pro His Leu Gly Asp lie Gly 
290 295 

<210> 5 
<211> 6334 


5 


<212> DNA 

<213> Rhodococcus AN12 


<400> 5 


attcagacca 

acaatcagtc 

caactagcaa 



tt CCftCJ3.cl.5tC 

60 

aagctttgtc 



gcaggtgcgg 

agacggccgc 

tgcacjgcatt 

120 

ggaaccaaat 

tctccactgt 





180 

tgcacccaga 

caaggccttc 

tgctcggtag 




240 

gggttgatga 

cgtgcacgga 

tgctgtcttg 




300 

actcggagca 

atgcgccagt 




c 1 1 cj t c c t cj a. 

360 

accggcttga 

tggggtcgtc 

cataccggct 




420 

gcgacgggga 

gggacgttcc 

gagctgaaca 

ttcatgcgag 

ttcctttgat 


4 80 

acagcttatg 

tctccggtgt 

ccatattcag 

cgacacgcgt 



54 0 

acacatagtg 

tagccagctg 

tccagttttc 

gcacactacg 


a a cat at ttt 

600 

gtggttgatc 

agtcaataag 

ctgtccat at 

ggacgagaaa 

gaggttcgcg 



gcgcaaagaa 

accgaacgaa 

aaatgcaggt 

catcaagcag 



720 

ctcctggcag 

accattcaga 

acgcgcacga 

ct ccacgact 



780 

agccgggctt 

caacgcgaat 

tctggctaca 

agctctcgcg 



840 

aactgcctct 

gagctgcgca 

aatctatttc 




900 

cagccgaacc 

gttgccaccg 

ttgcagatgt 

ttcaccgtcg 



960 

tgagcatgag 

tcatcgtaaa 

aacatcctct 

gacctgctat 

ggccccaatg 

at cacct at t 


accaaggcgg 

cggcttcgcc 

gccgctgcca 

gcaggctccc 

ccacctacgc 



gctcgcgctt 

cggtgctccg 

cccgcaggcc 

caggagcgag 

tttgcgcctc 

cjttt agt cca 


tctaaggggt 

tcctagctgg 

cttgaggtcg 

caacgcatcc 




gaacgcatca 

tctcgatcca 

gcgtggtttc 

ttgaccataa 

atcgagaggt 



gacaacgcca 

tcgacgtcta 

ccgaagctgg 

attcgctgcg 

atgccaagag 

gacgttcgtt 

1320 

gatgctcatg 

tgatgggttt 

acctgcaaaa 

atagtcagca 

gccaaatcgg 

aggcggcggc 


ttcgccgccg 

ctgccagcag 

gctcccccac 

ctacgcgctc 

cgcttcgctc 

gcgcttcggt 

1440 

gctccgcccg 

caggcccagg 

agcgagtttg 

cgcctcgttt 

agtccatcta 

aggggttcct 

1500 

agctggcttg 

aggtcgcaac 

gcatcctgaa 

gtcgatcgag 

gagcaggaac 

gcatcatctc 

1560 

gatccagcgt 

ggtttcttga 

ccataaatcg 

agaggtacac 

gcccatgaca 

acgccatcga 

1620 

cgtctaccga 

agctggattc 

gctgcgatgc 

caagaggacg 

ttcgttgatg 

ctcatgtgat 

1680 

gggtttacct 

gcaaaaatag 

tcagcagcca 

aatcggccgg 

cctttttcta 

tctgcccggt 

1740 




tgaaacaggc 

cgt ctct ctg 

teaaggecaa 

gccgctacgc 

1800 

ggt get at eg 



ccagctt cag 

ageggcaagt 

ategggggga 

1860 

tgccctcaag 

tgtggttcat 



ageaacgett 

ttcacttgcg 

1920 

clcLCCCfclt att 



t gcgggcaat 

egataatgea 

agtgat cacg 

1980 




gagt ccgagg 

atctcgccga 

agaegtaagg 

2040 

aaga !Hltc 



cgccggaat g 

atcgcaacca 

tgacegggge 

2100 




tgttagtt ct 

gccccaagac 

cagctcgggc 

2160 

aCgCaatCCa 



a cgt cccat c 

gaaccaaact 

caaattctcg 

2220 

ccacagttcc 



cact gc at ca 

acagegattt 

cccagggtgc 

2280 

aeggttgeca 


ttttt ccgt c 



caacgccaga 

2340 

tgeaatttte 

gtcagatacg 



tcaccgccga 

tetttegtae 

2400 

atcgagacca 

cccgaatt ac 




gtttggaagt 

2460 

ccaccgatcg 




ttctcactca 

cgt c a cc act 

2520 

gaacatgagt 



ccagccgtt t 

tttccgtgag 

tgatttcaac 

2580 

agcgcgcacg 

t at ccgt cgc 



gtacgccaac 

gaegacegtt 

2640 

ggtegcaget 

tt ccaggct g 

ccgaaagfccc 




2700 

tgtatggcgc 

at ggt cat eg 




gatgagcaac 

2760 

aacttgagaa 

att t cgt ct g 





2820 

ccagcccttt 





cgttgacaat 

2880 

ggtcacaccg 

tt cacgccag 



ttaaatgttt 

ctgacgtggt 

2940 

gatttgttga 

agtttgggt c 




acgacacgag 

3000 

gaegggagge 

eggt etttge 

eggaaaggt g 




3060 

ggtcctagtc 

aagt aggagc 




cct tegt cgc 

3120 

egctegtatt 

t teggagtaa 

atccagctag 



tcgaggttca 

3180 

tcgattattt 

ggtttttatc 

cact t aacca 




3240 

gtcagtgcgg 

cgtgt egggg 

gagt cget ag 



ttcgagaaca 

3300 

ccgaccttct 

cttctggggt 

tetgett gt c 



accteggt at 

3360 

tcacccaagt 

gttcaat ate 

gaatcegget 


aatcaggggt 

gtagtagcag 

3420 

cacatcgcag 

ccagaatctc 

ggacgattcg 

gcgcgttcac 

cagcatgaat 

ccaaccataa 


acgtcatgcc 

caccccatag 

atcaggccct 

egatgategt 

aaatgecaac 

ggctagtegg 

3540 

aggatgaata 

ccgtagcttc 

gtgcttcacg 

catcaaccct 

ctgatctget 

gcactcagaa 

3600 

ttgeatgace 

tcccgaatga 

ctgeataact 

egtegtagae 

ctgagcaacg 

aacgaaggee 

3660 
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gatcagcatt gtccatgaag agttggacga acttcggccg gacgaggcca atccacggcg 3720 

cagtcaaagt ttcaaaatca tgtgcctcga ggtgctcatg cattgcaacc gcccatgcgg 3780 

cccctcgagc ggcgcaccag tctcgttcaa ctccctcgct gtccgaaatg tcgtatttaa 3840 

ggcccagtga tcgtccaact tcggcagctg cgtcactggc acgtttccaa tcgtcaccgc 3900 

gtaagtcgtt gagctttccg agttcatcgc ctagaagcag ctcagacatt gcaaaaacgg 3960 

tcatcgaact gacccatcgt ggaccgacta gtgcaccaag gtcgtcgtcg gtgatctgca 4020 

tgccgcgaag ttcgtcgacg acagcttggc cttccaaacc tactctggcc ctgagtattt 4080 

cagttattac gagatgatcg ttcggccagc ctgatttgat ccggagtgca gtcgttacga 4140 

ctcgttccgt gggcaggttt cggcgtgagg cgagtttttc tcctgcctca tgtgcaacct 4200 

tctcaaattg ctgtcgaatg taggtgttta ccgggattgc gtctgtcggg tagccgatca 4260 

aggtgtgtcc tcctgtgtgt tcggttgtca gcctatgtcg ccgagatgtg ggtattccga 4320 

ggcgattgtc attcgttgtg ctcggtcgat gttcagtgcg gcttcgtacg cgaggtctgc 4380 

y> 

O gttttcgacg taactcgcgt aggtcgagta ctcacctacg cgaaccgttc ggatcatttg 4440 

E3 

r'j gcgccgtagt ccgggttggt cgaagaagcc ttgaccaggg acgaatcctt cgaccagtgc 4500 

r 2 gcactcttca ggtgttgcgt tggggtgcag cattttgaca gattcaggtt cgtctacgcg 4 5 60 

W gagagtcatt ttcaccgcga agttcgagcg ggcgtttcca cccacgatgg aagcttccgc 4 620 

= acgttgagcc aagagcaaca ttctgatgcc tgctttggca gactgagcag cgatctgtcg 4 68 0 

Iji aacaagcgat gtgatgcggg gtgcgtatct gtctgctggt ttcagaccgt tggttgcatc 4740 

y gaaatcctgt gcgccctcga tgattccggg aaattcttcc agtacgagca ggatgagagg 4800 

Q tagtgctggc gagaacaacg aaattttgtc tatgcgtcga tcccagaaac actcgattcg 48 60 

tcggtcagat tctgctttga cgaactggag cactcggagg actttgtcaa aatcgttcag 4 920 

cccgagctcg atgttcggtt cagccggtcg tcggtggacg aatggggcta gtaagacgga 4 98 0 

agtgggatcg actccgacga cacgcacagc gggattcgat ccagcctgtg cgagcaggtt 504 0 

gtaggtgcat tgagattttc cggaacgggt tttgccttga atgagccagt gagcagcatc 5100 

tttggctata tcgaccatga caggttgagc tagttcgttc catccaatcg caatcgggat 5160 

cgtgtctgtt tgatccatca ggcgtccgtg cttttgtcga acggaagatc cttttcttgc 5220 

tcccaccagg gccgattgtc cccgagtatg ccgccggcct cttccttcaa tgtgccggcc 5280 

gatgagtcct cgacgtcact gagccatgct gcatctcgtg cttgagaaat ggtgtctgca 5340 

tcgatcagaa gtagctcgac ccgacgcggc tctactttgg tgaaactggc acgtagagca 5400 

ccgaaagcat cggctatttt gaccgtcttc gatgtcatat cttcaccggt gatccctgtc 54 60 

ggaaggtcga aagcgactga tcgagtcaat ccgtcgtccg aaaatttgta gctacgaatg 5520 

atgggaggct gcccagagga gttgatcaga ccaagattgg ccgcagcacc tgcaacttcc 5580 
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si 








5640 

ct cacjcjt aga. 






5700 

CJCCjt CCj3.ci.CIB. 

t gtacagcag 





57 60 

cjcaatttttg 




tggaacgt ct 

tttaacctgg 

5820 

tcaccgcgtc 



atcggggcca 

ctccacaacg 

acattcggac 

5880 

tatctacttc 

gacttgctca 





5940 

atcccgaggt 

tccattctga 

agattgcttt 

gcactcgatc 




ccgtctcagc 

ctctacgccc 

ttatgtaccg 

ggacaggggt 

attcacggtc 



cccgccagcc 

ctcaggcact 

ggcacgtcac 

cgcacgcgct 

ggtcttcgag 



tgatgacctt 

tccatctggg 

ttagtccact 

ggatcccatc 

ggcgctcaat 

tccggattca 


ctcggatgta 

tccaggtatc 

tctctgcatg 

cactgacaga 

tggaacagaa 

cctgtcggaa 

6240 

gaggggatct 

gcaccaggtc 

accgttcgtt 

cagcccatga 

gtcccgacgc 

tcttgcattc 

6300 

cgctggaaag 

cttaatatct 

tgcgtgccaa 

caat 



6334 


<210> 6 
<211> 11241 
<212> DNA 

<213> Plasmid pRHBR17 
<400> 6 

ttctcatgtt tgacagctta tcatcgataa gctttaatgc ggtagtttat cacagttaaa 60 

ttgctaacgc agtcaggcac cgtgtatgaa atctaacaat gcgctcatcg tcatcctcgg 120 

caccgtcacc ctggatgctg taggcatagg cttggttatg ccggtactgc cgggcctctt 180 

gcgggatatc gtccattccg acagcatcgc cagtcactat ggcgtgctgc tagcgctata 240 

tgcgttgatg caatttctat gcgcacccgt tctcggagca ctgtccgacc gctttggccg 300 

ccgcccagtc ctgctcgctt cgctacttgg agccactatc gactacgcga tcatggcgac 360 

cacacccgtc ctgtggatcc tctacgccgg acgcatcgtg gccggcatca ccggcgccac 420 

aggtgcggtt gctggcgcct atatcgccga catcaccgat ggggaagatc gggctcgcca 480 

cttcgggctc atgagcgctt gtttcggcgt gggtatggtg gcaggccccg tggccggggg 540 

actgttgggc gccatctcct tgcatgcacc attccttgcg gcggcggtgc tcaacggcct 600 

caacctacta ctgggctgct tcctaatgca ggagtcgcat aagggagagc gtcgaccgat 660 

gcccttgaga gccttcaacc cagtcagctc cttccggtgg gcgcggggca tgactatcgt 720 

cgccgcactt atgactgtct tctttatcat gcaactcgta ggacaggtgc cggcagcgct 780 

9 


ctgggtcatt 

ttcggcgagg 

accgctttcg 

ct g gag eg eg 


gcctgtcgct 


tgcggtattc 

ggaatcttgc 

acgccctcgc 

tcaagccttc 




acgtttcggc 

gagaagcagg 

ccattatcgc 





cttgctggcg 

ttcgcgacgc 






cggcggcatc 

gggatgcccg 

cgttgcaggc 

catgctgtcc 




tcagggacag 

ctt caaggat 

cgctcgcggc 




114 0 

gctgatcgtc 

acggcgattt 

atgccgcctc 





tgtaggcgcc 

gccctatacc 

ttgtctgcct 

ccccgcgttg 

c 9" t c g c g CJ" t g 



ggccacctcg 

acctgaatgg 

aagceggegg 

cacctcgcta 




attggagcca 

atcaattctt 

geggagaact 




1380 

atatccatcg 

cgtccgccat 

ctccagcagc 

cgcacgcggc 




ggcgtttttc 

cataggctcc 

gcccccctga 

cgagcatcac 

3. a. 3. 3. 3. t CCJclC 



gaggtggcga 

aacccgacag 

gactataaag 





cgtgcgctct 

cctgttccga 

ccctgccgct 

taceggatae 

ctgtccgcct 



gggaagcgtg 

gcgctttctc 

atagctcacg 

ctgtaggtat 

etcagttegg 

tgtaggtcgt 


tcgctccaag 

ctgggctgtg 

tgcacgaacc 

ccccgttcag 

cccgaccgct 

gcgccttatc 


cggtaactat 

cgtcttgagt 

ccaacccggt 

aagacacgac 

ttatcgccac 

tggcagcagc 


cactggtaac 

aggatt age a 

gagegaggta 

tgtaggcggt 




gtggcctaac 

tacggctaca 

ctagaaggac 

agtatttggt 

atctgcgctc 

tgetgaagee 


agttaccttc 

ggaaaaagag 

ttggtagctc 

ttgatcegge 

aaacaaacca 

ccgctggtag 


cggtggtttt 

tttgtttgca 

agcagcagat 

tacgegcaga 

aaaaaaggat 

ctcaagaaga 


tcctttgatc 

ttttctaegg 

ggtctgaege 

tcagtggaac 

gaaaactcac 

gttaagggat 


tttggtcatg 

agattatcaa 

aaaggat ctt 

cacctagatc 

cttttaaatt 



ttttaaatca 

atctaaagta 

tatatgagta 

aacttggtct 

gacagttacc 



cagtgaggca 

cctatctcag 

cgatctgtct 

attt cgttca 




cgtcgtgtag 

ataactacga 

tacgggaggg 

cttaccatct 

ggccccagtg 

ctgeaatgat 


accgcgagac 

ccacgctcac 

cggctccaga 

tttatcagca 

ataaaccagc 

cagceggaag 


ggccgagcgc 

agaagtggtc 

ctgcaacttt 

atccgcctcc 

atccagtcta 

ttaattgttg 


ccgggaagct 

agagtaagta 

gttcgccagt 

taatagtttg 

cgcaacgttg 

ttgccattgc 

2520 

tgcaggcatc 

gtggtgtcac 

getegtegtt 

tggtatggct 

tcattcagct 

ccggttccca 

2580 

acgatcaagg 

cgagttacat 

gatcccccat 

gttgtgcaaa 

aaagcggtta 

gctccttcgg 

2640 

tcctccgatc 

gttgtcagaa 

gtaagttggc 

cgcagtgtta 

10 

tcactcatgg 

ttatggcagc 

2700 





cgtaagatgc 

ttttctgtga 

ctggtgagta 

2760 

ctcaaccaag 


aatagtgtat 

gcggcgaccg 

agttgctctt 

gcccggcgtc 

2820 

aacacgggat 

aataccgcgc 


aactttaaaa 

gtgctcatca 

ttggaaaacg 

2880 

ttcttcgggg 

cgaaaactct 


accgctgttg 

agatccagtt 

cgatgtaacc 

2940 

cactcgtgca 



ttttactttc 

accagcgttt 

ctgggtgagc 

3000 

aaaaacagga 



gggaataagg 

gcgacacgga 

aatgttgaat 

3060 


ttcctttttc 

aatattattg 

aagcatttat 

cagggttatt _ 

gtctcatgag 

3120 

cggatacata 

tttgaatgta 



ggggttCCgC 

gcacatttcc 

3180 

ccgaaaagt g 


tctaagaaac 

cattattatc 

atgacattaa 

cctataaaaa 

3240 

taggcgtatc 

acgaggccct 

ttcgtcttcg 

aataaatacc 

tgtgacggaa 

gatcacttcg 

3300 

cagaataaat 

aaatcctggt 

gtccctgttg 

ataccgggaa 

gccctgggcc 

aacttttggc 

3360 

gaaaatgaga 

cgttgatcgg 

cacgtaagag gttccaactt 

tcaccataat 

gaaataagat 

3420 

cactaccggg 

cgtatttttt 

gagttatcga 

gattttcagg 

agctaaggaa 

gctaaaatgg 

3480 

agaaaaaaat 


accaccgttg 

atatatccca 

atggcatcgt 

aaagaacatt 

3540 

ttgaggcatt 

tcagtcagtt 

gctcaatgta 

cctat aacca 

gaccgttcag 

attcagacca 

3600 

acaatcagtc 

caactagcaa 

ggcgacaacc 

ggtatcgcaa 

ttcgtgaaac 

aagctttgtc 

3660 

atgcgtccgc 

gctcttacga 

gcaggtgcgg 

agacggccgc 

tgcaggcatt 

ggaaccaaat 

3720 

tctccactgt 

gatggatagt 

gcgagacgat 

ccatgccagt 

catgtagggc 

tgcacccaga 

3780 

caaggccttc 

tgctcggtag 

atcgtgccga 

agctgaacgg 

ctcgtt cggc 

gggttgatga 

3840 

cgtgcacgga 

tgctgtcttg tcagtcgcaa 

cagttccgtc 

cttgcgtgca 

actcggagca 

3900 

atgcgccagt 

cgaatacttc 

acacggccgt 

cgggagtgag 

cttgtcctga 

accggcttga 

3960 

tggggtcgtc 

cataccggct 

acgaacaccg 

ggaactgatc 

agcggtagtt 

gcgacgggga 

4020 

gggacgttcc 

gagctgaaca 

ttcatgcgag 

ttcctttgat 

cgaggctggt 

acagcttatg 

4080 

tctccggtgt 

ccatattcag 

cgacacgcgt 

tcatctacac 

tcaaaaccgt 

acacatagtg 

4140 

tagccagctg 

tccagttttc 

gcacactacg 

ttagcaactg 

aacatatttt 

gtggttgatc 

4200 

agtcaataag 

ctgtccatat 

ggacgagaaa 

gaggttcgcg 

cgatgattca 

gcgcaaagaa 

4260 

accgaacgaa 

aaatgcaggt 

catcaagcag 

gcgtccgtgg 

atctgtcaca 

ctcctggcag 

4320 

accattcaga 

acgcgcacga 

ctccacgact 

gtcgcaatgg 

agctacgaga 

agccgggctt 

4380 

caacgcgaat 

tctggctaca 

agctctcgcg 

gacatcacat 

ctgttgtggc 


4440 

gagctgcgca 

aatctatttc 

ccgttttctc 

gttgacgagc 

ttgacgtcag 

cagccgaacc 

4500 

gttgccaccg 

ttgcagatgt 

ttcaccgtcg 

accatcagta 

cttggcgtgg 

tgagcatgag 

4560 

tcatcgtaaa 

aacatcctct 

gacctgctat 

ggccccaatg 

atcacctatt 

accaaggcgg 

4620 
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cggcttcgcc 
cggtgctccg 
tcctagctgg 
tctcgatcca 
tcgacgtcta 
tgatgggttt 
ctgccagcag 
caggcccagg 
aggtcgcaac 
ggtttcttga 
agctggattc 
gcaaaaatag 
agaccaacca 
cagccctgac 
tgtggttcat 
atcgggggcc 
caagtcgcgc 
cgaatcatca 
tgctcaacga 
cgggaccacg 
aacgcttgtg 
tgtcgaccac 
gtcagatacg 
cccgaattac 
aacatcgcat 
agcgcgtgaa 
tatccgtcgc 
ttccaggctg 
atggtcatcg 
atttcgtctg 
ccgcaggaac 
ttcacgccag 


gccgctgcca gcaggctccc 
cccgcaggcc caggagcgag 
cttgaggtcg caacgcatcc 
gcgtggtttc ttgaccataa 
ccgaagctgg attcgctgcg 
acctgcaaaa atagtcagca 
gctcccccac ctacgcgctc 
agcgagtttg cgcctcgttt 
gcatcctgaa gtcgatcgag 
ccataaatcg agaggtacac 
gctgcgatgc caagaggacg 
tcagcagcca aatcggccgg 
tgaaacaggc cgtctctctg 
agagagacac ccagcttcag 
gcgggtgaaa gttgttgctc 
gcacatccgc tgcgggcaat 
cagcttcgac gagtccgagg 
tccacgatcg cgccggaatg 
tctgagcatc tgttagttct 
cgattgcccg acgtcccatc 
gatccccgcc cactgcatca 
tttttccgtc gccactacca 
cagcgagaac ttgatcagct 
gtagtggcgc agcaaatccc 
ccgagaagga ttcgaggatg 
cgtggacgtg ccagccgttt 
agccgtacat ttcacgttcc 
ccgaaagtcc agtccatagg 
tcaccatcgc aacagatcca 
cacgatgtgc accgactttt 
gaaggcctcc gaatccagaa 
aaatcggccg gccacaggcg 


ccacctacgc gctccgcttc 
tttgcgcctc gtttagtcca 
tgaagtcgat cgaggagcag 
atcgagaggt acacgcccat 
atgccaagag gacgttcgtt 
gccaaatcgg aggcggcggc 
cgcttcgctc gcgcttcggt 
agtccatcta aggggttcct 
gagcaggaac gcatcatctc 
gcccatgaca acgccatcga 
ttcgttgatg ctcatgtgat 
cctttttcta tctgcccggt 
tcaaggccaa gccgctacgc 
agcggcaagt atcgggggga 
agcaacgctt ttcacttgcg 
cgataatgca agtgatcacg 
atctcgccga agacgtaagg 
atcgcaacca tgaccggggc 
gccccaagac cagctcgggc 
gaaccaaact caaattctcg 
acagcgattt cccagggtgc 
acctccatac caacgccaga 
tcaccgccga tctttcgtac 
agagatacga gtttggaagt 
ttctcactca cgtcaccact 
tttccgtgag tgatttcaac 
gtacgccaac gacgaccgtt 
tcgtggagcc gctgaccagc 
gtcccgagtt gatgagcaac 
cccgcacagc aggggcagat 
cctttcggac cgttgacaat 
ttaaatgttt ctgacgtggt 

12 




t ct aa.cfcj'Cf'crt 


gaacgcatca 


gacaacgcca 


gat get cat g 


ttcgccgccg 


gctccgcccg 




gatccagegt 


cgtctaccga 


gggtttacct 


cagccccccg 

5340 

ggtgctatcg 


tgccctcaag 


aaccgatatt 


aagattttcc 

5580 

cgcacaagtc 

5640 

agattcttcc 

5700 

acgcaatcca 

5760 

ccacagttcc 


aeggttgeca 


tgeaatttte 


atcgagacca 


ccaccgatcg 

6060 

gaacatgagt 


agcgcgcacg 


ggtegcaget 


tgtatggcgc 

6300 

aacttgagaa 

6360 

ccagcccttt 

6420 

ggtcacaccg 

6480 

gatttgttga 

6540 


agtttgggt c 

gcagttcgtg 

ccggatgccg 

cgcttatcgg 

acgacacgag 

gaegggagge 


cggtctttgc 

cggaaaggtg 

ttcagcactt 

acgctggtca 

taacgagegg 

ggtcctagtc 


aagtaggagc 

ctcgaaggcg 

gcggcagggt 

ggtccaacac 

ccttcgtcgc 

egctegtatt 









ggtfctttatc 



cgtttatcgc 

tgatctgctg 

gtcagtgcgg 


cgtgtcgggg 




ttcgagaaca 

ccgaccttct 







tcacccaagt 







cacatcgcag 




gcgcgttcac 

cagcatgaat 

ccaaccataa 

acgtcatgcc 




cgatgatcgt 

aaatgccaac 

ggctagtegg 

aggatgaata 





ctgatctgct 

gcactcagaa 

ttgeatgace 




cgtcgtagac 

ctgagcaacg 

aacgaaggee 

gatcagcatt 


gt^ccatgaag 

agttggacga 

acttcggccg 

gacgaggcca 

atccacggcg 

cagt caaagt 


ttcaaaatca 

tgtgcctcga 

ggtgctcatg 

cattgcaacc 

gcccatgcgg 

cccctcgagc 


ggcgcaccag 

tctcgttcaa 

ctccctcgct 

gtccgaaatg 

tegtatttaa 

ggcccagtga 


tcgtccaact 

tcggcagctg 

cgtcactggc 

acgtttccaa 

tcgtcaccgc 

gtaagtcgtt 

7500 

gagctttccg 

agttcatcgc 

ctagaagcag 

ctcagacatt 

geaaaaaegg 

tcatcgaact 


gacccatcgt 

ggaccgacta 

gtgcaccaag 

gtcgtcgtcg 

gtgatctgea 

tgccgcgaag 


ttcgtcgacg 

acagcttggc 

cttccaaacc 

tactctggcc 

ctgagtattt 

cagttattac 

7 68 0 

gagatgatcg 

tt cggccagc 

ctgatttgat 

ccggagtgca 

gtcgttacga 

ctcgttccgt 

7740 

gggcaggttt 

cggcgtgagg 

cgagtttttc 

tcctgcctca 

tgtgcaacct 

tctcaaattg 


ctgtcgaatg 

taggtgttta 

ccgggattgc 

gtctgtcggg 

tagecgatea 

aggtgtgtcc 


tcctgtgtgt 

tcggttgtca 

gcctatgtcg 

ccgagatgtg 

ggtattccga 

ggcgattgtc 


attcgttgtg 

ctcggtcgat 

gttcagtgcg 

get t eg t a eg 

egaggtctge 

gttttcgacg 


taactcgcgt 

aggtcgagta 

ctcacctacg 

cgaaccgttc 

ggatcatttg 

gcgccgtagt 


ccgggttggt 

cgaagaagcc 

ttgaccaggg 

acgaatcctt 

cgaccagtgc 

gcactcttca 


ggtgttgcgt 

tggggtgcag 

cattttgaca 

gattcaggtt 

cgtctacgcg 

gagagtcatt 

8160 

ttcaccgcga 

agttcgagcg 

ggcgtttcca 

cccacgatgg 

aagcttccgc 

aegttgagee 


aagagcaaca 

ttctgatgcc 

tgctttggca 

gactgagcag 

cgatctgtcg 

aacaagegat 

8280 

gtgatgcggg 

gtgcgtatct 

gtctgctggt 

ttcagaccgt 

tggttgcatc 

gaaatcctgt 

8340 

gcgccctcga 

tgattccggg 

aaattcttcc 

agtacgagca 

ggatgagagg 

tagtgctggc 

8400 

gagaacaacg 

aaattttgtc 

tatgcgtcga 

tcccagaaac 

13 

actcgattcg 

teggtcagat 

8460 


i. 



cgaactggag 

cact cggagg 

actttgtcaa 

aatcgttcag 

cccgagctcg 

8520 


cagccggtcg 

tcggtggacg 

aatggggcta 

gtaagacgga 

agtgggatcg 

8580 

actccgacga 

cacgcacagc 

gggattcgat 

ccagcctgtg 

cgagcaggtt 

gtaggtgcat 

8640 

tga.ga.ttttc 

cggaacgggt 

tttgccttga 

atgagccagt 

gagcagcatc 

tttggctata 

8700 



tagttcgttc 

catccaatcg 

caategggat 

cgtgtctgtt 

8760 




aeggaagate 

ettttcttge 

tcccaccagg 

8820 


cccgagtatg 

ccgccggcct 

cttccttcaa 

tgtgccggcc 

gatgagtcct 

8 880 

cga.cgt ca.ct 

gagccatgct 

gcatctcgtg 

cttgagaaat 

ggtgtctgca 

tcgatcagaa 

8940 

gtagct cgac 

ccgacgcggc 

tctactttgg 

tgaaactggc 

aegtagagea 

ccgaaagcat 

9000 

cggct a.ttt t 

gaccgt ctt c 

gat gt cat at 

cttcaccggt 

gatccctgtc 

ggaaggtcga 

9060 




aaaatttgta 

gctacgaatg 

atgggaggct 




ccaagattgg 

ccgcagcacc 

t gca act tec 

ggggttcctc 

9180 

gccaccatcg 

agctgtacga 

cgtttgcgac 

gccgagcctt 

cgttgcctct 

ctcaggtaga 

924 0 

ccattgccac 

aacgcacacc 

agcagcacac 

tgaccaaaag 

ccacatctga 

gegtcgaaga 

9300 

tgtacagcag 

cagaagcaac 

agaaacgtag 

aggacagaat 

egggtaateg 

gcaatttttg 

9360 

ccttgagttt 

tgctcgcaaa 

atttgccagg 

tggaaegtet 

tttaacctgg 

tcaccgcgtc 

9420 

gaacggcttc 

gtagttgctc 

atcggggcca 

ctccacaacg 

acatteggae 

tatctacttc 

9480 

gacttgctca 

tctacgttcc 

acaaccacga 

ttcgactgga 

aegagagege 

atcccgaggt 

954 0 

tccattctga 

agattgcttt 

gca etc gate 

actcatcaaa 

gtctctggaa 

ccgtctcagc 

9600 

ctctacgccc 

ttatgtaccg 

ggacaggggt 

attcaeggtc 

aaatacactg 

cccgccagcc 

9660 

ctcaggcact 

ggcacgtcac 

cgcacgcgct 

ggtcttcgag 

tacggcgacg 

tgatgacctt 

9720 

tccatctggg 

ttagtccact 

ggatcccatc 

ggcgctcaat 

teeggattea 

ctcggatgta 

9780 

tccaggtatc 

tctctgcatg 

cactgacaga 

tggaacagaa 

cctgtcggaa 

gaggggatct 

9840 

gcaccaggtc 

accgttcgtt 

cagcccatga 

gtcccgacgc 

tettgeatte 

cgctggaaag 

9900 

cttaatatct 

tgcgtgccaa 

caatctggat 

attaeggect 

ttttaaagac 

cgtaaagaaa 

9960 

aataagcaca 

agttttatcc 

ggectttatt 

cacattcttg 

cccgcctgat 

gaatgetcat 

10020 

ccggaattcc 

gtatggcaat 

gaaagacggt 

gagctggtga 

tatgggatag 

tgttcaccct 

10080 

tgttacaccg 

ttttccatga 

gcaaactgaa 

aegttttcat 

cgctctggag 

tgaataccac 

10140 

gacgatttcc 

ggcagtttct 

acacatatat 

tegcaagatg 

tggcgtgtta 

eggtgaaaac 

10200 

ctggcctatt 

tccctaaagg 

gtttattgag 

aatatgtttt 

tcgtctcagc 

caatccctgg 

10260 

gtgagtttca 

ccagttttga 

tttaaacgtg 

gecaatatgg 

acaacttctt 

cgcccccgtt 

10320 

ttcaccatgg 

gcaaatatta 

tacgeaagge 

gacaaggtgc 

tgatgecget 

ggcgattcag 

10380 
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gttcatcatg 

ccgtttgtga 

tggcttccat 

gtcggcagaa 

tgcttaatga 

attacaacag 


tactgcgatg 

agtggcaggg 

cggggcgtaa 

tttttttaag 

gcagttattg 

gtgcccttaa 


acgcctggtg 

ctacgcctga 

ataagtgata 

ataagcggat 

gaatggcaga 

aattcgaaag 


caaattcgac 

ccggtcgtcg 

gttcagggca 

gggtcgttaa 

atagccgctt 

atgtctattg 


ctggtttacc 

ggtttattga 

ctaccggaag 

cagtgtgacc 

gtgtgcttct 

caaatgcctg 


aggccagttt 

gctcaggctc 

tccccgtgga 

ggtaataatt 

gacgatatga 

tcatttattc 

1074 0 

tgcctcccag 

agcctgataa 

aaacggtgaa 

tccgttagcg 

aggtgccgcc 

ggcttccatt 


caggtcgagg 

tggcccggct 

ccatgcaccg 

cgacgcaacg 

cggggaggca 

gacaaggtat 


agggcggcgc 

ctacaatcca 

tgccaacccg 

ttccatgtgc 

t cgccgaggc 

ggcataaatc 


gccgtgacga 

tcagcggtcc 

agtgatcgaa 

gttaggctgg 

taagagccgc 

gagcgatcct 


tgaagctgtc 

cctgatggtc 

gtcatctacc tgcctggaca 

gcatggcctg 

caacgcgggc 

11040 

atcccgatgc 

cgccggaagc 

gagaagaatc 

ataatgggga 

aggccatcca 

gcctcgcgtc 

11100 

gcgaacgcca 

gcaagacgta 

gcccagcgcg 

tcggccgcca 

tgccggcgat 

aatggcctgc 

11160 

ttctcgccga 

aacgtttggt 

ggcgggacca 

gtgacgaagg 

cttgagcgag 

ggcgtgcaag 

11220 

attccgaata 

ccgcaagcga 

c 




11241 


<210> 7 
<211> 9652 
<212> DNA 

<213> Plasmid pRHBR17 
<400> 7 

ttctcatgtt tgacagctta tcatcgataa gctttaatgc ggtagtttat cacagttaaa 60 

ttgctaacgc agtcaggcac cgtgtatgaa atctaacaat gcgctcatcg tcatcctcgg 120 

caccgtcacc ctggatgctg taggcatagg cttggttatg ccggtactgc cgggcctctt 180 

gcgggatatc gtccattccg acagcatcgc cagtcactat ggcgtgctgc tagcgctata 24 0 

tgcgttgatg caatttctat gcgcacccgt tctcggagca ctgtccgacc gctttggccg 300 

ccgcccagtc ctgctcgctt cgctacttgg agccactatc gactacgcga tcatggcgac 360 

cacacccgtc ctgtggatcc tctacgccgg acgcatcgtg gccggcatca ccggcgccac 420 

aggtgcggtt gctggcgcct atatcgccga catcaccgat ggggaagatc gggctcgcca 480 

cttcgggctc atgagcgctt gtttcggcgt gggtatggtg gcaggccccg tggccggggg 54 0 

actgttgggc gccatctcct tgcatgcacc attccttgcg gcggcggtgc tcaacggcct 600 

caacctacta ctgggctgct tcctaatgca ggagtcgcat aagggagagc gtcgaccgat 660 

15 


gcccttgaga gccttcaacc cagtcagctc cttccggtgg gcgcggggca tgactatcgt 720 

cgccgcactt atgactgtct tctttatcat gcaactcgta ggacaggtgc cggcagcgct 780 

ctgggtcatt ttcggcgagg accgctttcg ctggagcgcg acgatgatcg gcctgtcgct 84 0 

tgcggtattc ggaatcttgc acgccctcgc tcaagccttc gtcactggtc ccgccaccaa 900 

acgtttcggc gagaagcagg ccattatcgc cggcatggcg gccgacgcgc tgggctacgt 960 

cttgctggcg ttcgcgacgc gaggctggat ggccttcccc attatgattc ttctcgcttc 1020 

cggcggcatc gggatgcccg cgttgcaggc catgctgtcc aggcaggtag atgacgacca 1080 

tcagggacag cttcaaggat cgctcgcggc tcttaccagc ctaacttcga tcactggacc 1140 

gctgatcgtc acggcgattt atgccgcctc ggcgagcaca tggaacgggt tggcatggat 1200 

tgtaggcgcc gccctatacc ttgtctgcct ccccgcgttg cgtcgcggtg catggagccg 1260 

ggccacctcg acctgaatgg aagccggcgg cacctcgcta acggattcac cactccaaga 1320 

attggagcca atcaattctt gcggagaact gtgaatgcgc aaaccaaccc ttggcagaac 1380 

P atatccatcg cgtccgccat ctccagcagc cgcacgcggc gcatctcggg ccgcgttgct 1440 

Q ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca 1500 

"i. gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct 1560 

y \ 

W cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc 1620 

% gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt 1680 

U 

tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc 1740 

y cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc 1800 

Q cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg 18 60 

gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc 1920 

agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag 1980 

cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga 2040 

tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat 2100 

tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag 2160 

ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat 2220 

cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc 2280 

cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat 2340 

accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag 24 00 

ggccgagcgc agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg 24 60 

ccgggaagct agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc 2520 

ttcatctaca ctcaaaaccg tacacatagt gtagccagct gtccagtttt cgcacactac 2580 
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gttagcaact 

gaacatattt 

tgtggttgat 

cagtcaataa 


tggacgagaa 


agaggttcgc 

gcgatgattc 

agcgcaaaga 

aaccgaacga 

aaaatgcagg 

tcatcaagca 


ggcgtccgtg 

gatctgtcac 

actcctggca 

gaccattcag 

aacgcgcacg 

actccacgac 


tgtcgcaatg 

gagctacgag 

aagccgggct 

tcaacgcgaa 

ttctggctac 

aagctctcgc 


ggacatcaca 

tctgttgtgg 

gaactgcctc 

tgagctgcgc 

aaatctattt 

cccgttttct 









gaccatcagt 

acttggcgtg 

gtgagcatga 

gtcatcgtaa 

aaacatcctc 

tgacctgeta 


tggccccaat 

gatcacctat 

taccaaggcg 

gcggcttcgc 

cgccgctgcc 

agcaggctcc 


cccacctacg 

cgctccgctt 

cgctcgcgct 

tcggtgctcc 

gcccgcaggc 

ecaggagega 


gtttgcgcct 

cgtttagtcc 

atctaagggg 

ttcctagctg 

gcttgaggtc 

gcaacgcatc 


ctgaagtcga 

t cgaggagca 

ggaacgcatc 

atctcgatcc 

agcgtggttt 

cttgaccata 


aatcgagagg 

tacacgccca 

tgacaacgcc 

atcgacgtct 

accgaagctg 

gattegctge 


gatgccaaga 

ggacgtt cgt 

tgatgctcat 

gtgatgggt t 

tacctgcaaa 

aatagtcagc 


agccaaatcg 

gaggcggcgg 

cttcgccgcc 

gctgccagca 

ggctccccca 

cctacgcgct 


ccgcttcgct 

cgcgcttcgg 

tgctccgccc 

gcaggcccag 

gagcgagttt 

gcgcctcgtt 


tagtccatct 

aaggggttcc 

tagctggctt 

gaggtcgcaa 

cgcatcctga 

agtcgatcga 


ggagcaggaa 

cgcatcatct 

cgatccagcg 

tggtttcttg 

accataaatc 

gagaggtaca 


cgcccatgac 

aacgccatcg 

acgtctaccg 

aagctggatt 

cgctgcgatg 

ccaagaggac 


gttcgttgat 

gctcatgtga 

tgggtttacc 

tgcaaaaata 

gtcagcagcc 

aaatcggccg 


gcctttttct 

atctgcccgg 

tcagcccccc 

gagaccaacc 

atgaaacagg 

ccgtctctct 


gtcaaggcca 

agccgctacg 

cggtgctatc 

gcagccctga 

cagagagaca 

cccagcttca 


gagcggcaag 

tatcgggggg 

atgccctcaa 

gtgtggttca 

tgcgggtgaa 

agttgttgct 


cagcaacgct 

tttcacttgc 

gaaccgatat 

tatcgggggc 

cgcacatccg 

ctgcgggcaa 


tcgataatgc 

aagtgatcac 

gaagattttc 

ccaagt cgcg 

ccagcttcga 

cgagtccgag 


gatctcgccg 

aagacgtaag 

gcgcacaagt 

ccgaat catc 

atccacgatc 

gcgccggaat 


gatcgcaacc 

atgaccgggg 

cagattcttc 

ctgctcaacg 

at ct gage at 

ctgttagttc 


tgccccaaga 

ccagctcggg 

cacgcaatcc 

acgggaccac 

gcgattgccc 

gacgtcccat 


cgaaccaaac 

tcaaattctc 

gccacagttc 

caacgcttgt 

ggatccccgc 

ccactgcatc 


aacagcgatt 

tcccagggtg 

cacggttgcc 

atgtcgacca 

etttttcegt 

cgccactacc 

4320 

aacctccata 

ccaacgccag 

atgcaatttt 

cgtcagatac 

geagegagaa 

cttgatcagc 

4380 

ttcaccgccg 

atctttcgta 

catcgagacc 

acccgaatta 

cgtagtggcg 

cagcaaatcc 

4440 

cagagatacg 

agtttggaag 

tccaccgatc 

gaacatcgca 

17 

tccgagaagg 

attcgaggat 

4500 


H 

m 

ru 


gtt ctcact c 

acgtcaccac 

tgaacatgag 

tagcgcgtga 

acgtggacgt 

gccagccgtt 


ttttccgtga 

gtgatttcaa 

cagcgcgcac 

gtatccgtcg 

cagccgtaca 

tttcacgttc 


cgtacgccaa 

cgacgaccgt 

tggtcgcagc 

tttccaggct 

gccgaaagt c 

cagt ccatag 


gtcgtggagc 

cgctgaccag 

ctgtatggcg 

catggtcatc 

gt caeca teg 

caacagatcc 


agtcccgagt 

tgatgagcaa 

caacttgaga 

aatttcgtct 

gcacgatgtg 

caccgacttt 


tcccgcacag 

caggggcaga 

tccagccctt 

tccgcaggaa 

cgaaggcctc 

cgaatccaga 


acctttcgga 

ccgttgacaa 

tggtcacacc 

gttcacgcca 

gaaateggee 

ggccacaggc 


gttaaatgtt 

tctgacgtgg 

tgatttgttg 

aagtttgggt 

cgcagttcgt 

gccggatgcc 


gcgcttatcg 

gacgacacga 

ggacgggagg 

ccggtctttg 

ccggaaaggt 

gttcagcact 


tacgctggtc 

ataacgagcg 

gggtcctagt 

caagtaggag 

cctcgaaggc 

ggcggcaggg 


tggtccaaca 

cccttcgtcg 

ccgctcgtat 

tttcggagta 

aatccagcta 

gttcagctcg 


gatactccac 

ttcgaggttc 

atcgattatt 

tggtttttat 

ccacttaacc 

agcagaaaca 


gcgtttatcg 

ctgatctgct 

ggtcagtgcg 

gcgtgtcggg 

ggagtegcta 

gtccgcggcg 


agtccccatg 

cttcgagaac 

accgaccttc 

tcttctgggg 

ttctgcttgt 

cttcaccagt 


gcatcgaaca 

gacctcggta 

ttcacccaag 

tgttcaatat 

cgaatccggc 

ttccctggcg 


taatcagggg 

tgtagtagca 

gcacatcgca 

gccagaatct 

eggacgatte 

ggcgcgttca 


ccagcatgaa 

tccaaccata 

aacgtcatgc 

ccaccccata 

gatcaggccc 

tcgatgatcg 


taaatgccaa 

cggctagtcg 

gaggatgaat 

accgtagctt 

cgtgcttcac 

gcatcaaccc 


tctgatctgc 

tgcactcaga 

attgcatgac 

ctcccgaatg 

actgeataac 

tegtegtaga 


cctgagcaac 

gaacgaaggc 

cgatcagcat 

tgtccatgaa 

gagttggacg 

aacttcggcc 


ggacgaggcc 

aatccacggc 

gcagtcaaag 

tttcaaaatc 

atgtgcctcg 

aggtgetcat 


gcattgcaac 

cgcccatgcg 

gcccctcgag 

cggcgcacca 

gtctcgttca 

actccct cgc 


tgtccgaaat 

gtcgtattta 

aggcccagtg 

atcgtccaac 

tteggcaget 

gcgtcactgg 


cacgtttcca 

atcgtcaccg 

cgtaagtcgt 

tgagctttcc 

gagttcatcg 

cctagaagca 

5940 

gctcagacat 

tgcaaaaacg 

gtcatcgaac 

tgacccatcg 

tggaccgact 

agtgcaccaa 


ggtcgtcgtc 

ggtgatctgc 

atgccgcgaa 

gttcgtcgac 

gacagcttgg 

ccttccaaac 


ctactctggc 

cctgagtatt 

tcagttatta 

cgagatgatc 

gttcggccag 

cctgattt ga 


tccggagtgc 

agtcgttacg 

actcgttccg 

tgggcaggtt 

teggegtgag 

gcgagttttt 


ctcctgcctc 

atgtgcaacc 

ttctcaaatt 

gctgtcgaat 

gtaggtgttt 

acegggattg 

6240 

cgtctgtcgg 

gtagccgatc 

aaggtgtgtc 

ctcctgtgtg 

ttcggttgtc 

agcctatgtc 

6300 

gccgagatgt 

gggtattccg 

aggcgattgt 

cattcgttgt 

geteggtega 

tgttcagtgc 

6360 

ggcttcgtac 

gcgaggtctg 

cgttttcgac 

gtaactcgcg 

18 

taggtcgagt 

actcacctac 

6420 


0 
H 

m 

fU 
M 


gcgaaccgtt 

cggatcattt 

ggcgccgtag 

tccgggttgg 

fccgaagaagc 

cttgaccagg 

6480 

gacgaatcct 

tcgaccagtg 

cgcactcttc 

aggtgttgcg 

ttggggtgca 

gcattttgac 

6540 

agattcaggt 

tcgtctacgc 

ggagagt cat 

tttcaccgcg 

aagttcgagc 

gggcgtttcc 


acccacgatg 

gaagcttccg 

cacgttgagc 

caagagcaac 

attctgatgc 

ctgctttggc 

6660 

agactgagca 

gcgatctgtc 

gaacaagcga 

tgtgatgcgg 

ggtgcgtatc 

tgtctgctgg 

6720 

tttcagaccg 

ttggttgcat 

cgaaatcctg 

tgcgccctcg 

atgattccgg 

gaaattcttc 

6780 

cagtacgagc 

aggatgagag 

gtagtgctgg 

cgagaacaac 

gaaattttgt 

ctatgcgtcg 

6840 

atcccagaaa 

cactcgattc 

gtcggtcaga 

ttctgctttg 

acgaactgga 

gcactcggag 

6900 

gactttgtca 

aaatcgttca 

gcccgagctc 

gatgttcggt 

tcagccggtc 

gtcggtggac 

6960 

gaatggggct 

agtaagacgg 

aagtgggatc 

gactccgacg 

acacgcacag 

cgggattcga 

7 020 

tccagcctgt 

gcgagcaggt 

tgtaggtgca 

ttgagatttt 

ccggaacggg 

ttttgccttg 

7080 

aatgagccag 

tgagcagcat 

ctttggctat 

atcgaccatg 

acaggttgag 

ctagttcgtt 

7140 

ccatccaatc 

gcaatcggga 

tcgtgtctgt 

ttgatccatc 

aggcgtccgt 

gcttttgtcg 

7200 

aacggaagat 

ccttttcttg 

ctcccaccag 

ggccgattgt 

ccccgagtat 

gccgccggcc 

7260 

tcttccttca 

atgtgccggc 

cgatgagtcc 

tcgacgtcac 

tgagccatgc 

tgcatctcgt 

7320 

gcttgagaaa 

tggtgtctgc 

atcgatcaga 

agtagctcga 

cccgacgcgg 

ctctactttg 

7380 

gtgaaactgg 

cacgtagagc 

accgaaagca 

tcggctattt 

tgaccgtctt 

cgatgtcata 

7440 

tcttcaccgg 

tgatccctgt 

cggaaggtcg 

aaagcgactg 

atcgagtcaa 

tccgtcgtcc 

7500 

gaaaatttgt 

agctacgaat 

gatgggaggc 

tgcccagagg 

agttgatcag 

accaagattg 

7560 

gccgcagcac 

ctgcaacttc 

cggggttcct 

cgccaccatc 

gagctgtacg 

acgtttgcga 

7620 

cgccgagcct 

tcgttgcctc 

tctcaggtag 

accattgcca 

caacgcacac 

cagcagcaca 

7680 

ctgaccaaaa 

gccacatctg 

agcgtcgaag 

atgtacagca 

gcagaagcaa 

cagaaacgta 

7740 

gaggacagaa 

tcgggtaatc 

ggcaattttt 

gccttgagtt 

ttgctcgcaa 

aatttgccag 

7800 

gtggaacgtc 

ttttaacctg 

gtcaccgcgt 

cgaacggctt 

cgtagttgct 

catcggggcc 

7860 

actccacaac 

gacattcgga 

ctatctactt 

cgacttgctc 

atctacgttc 

cacaaccacg 

7920 

attcgactgg 

aacgagagcg 

catcccgagg 

ttccattctg 

aagattgctt 

tgcactcgat 

7980 

cactcatcaa 

agtctctgga 

accgtctcag 

cctctacgcc 

cttatgtacc 

gggacagggg 

8040 

tattcacggt 

caaatacact 

gcccgccagc 

cctcaggcac 

tggcacgtca 

ccgcacgcgc 

8100 

tggtcttcga 

gtacggcgac 

gtgatgacct 

ttccatctgg 

gttagtccac 

tggatcccat 

8160 

cggcgctcaa 

ttccggattc 

actcggatgt 

atccaggtat 

ctctctgcat 

gcactgacag 

8220 

atggaacaga 

acctgtcgga 

agaggggatc 

tgcaccaggt 

caccgttcgt 

tcagcccatg 

8280 

agtcccgacg 

ctcttgcatt 

ccgctggaaa 

gcttaatatc 
19 

ttgcgtgcca 

acaatctgga 

8340 


8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 


tattacggcc tttttaaaga ccgtaaagaa aaataagcac aagttttatc cggcctttat 
tcacattctt gcccgcctga tgaatgctca tccggaattc cgtatggcaa tgaaagacgg 
tgagctggtg atatgggata gtgttcaccc ttgttacacc gttttccatg agcaaactga 
aacgttttca tcgctctgga gtgaatacca cgacgatttc cggcagtttc tacacatata 
ttcgcaagat gtggcgtgtt acggtgaaaa cctggcctat ttccctaaag ggtttattga 
gaatatgttt ttcgtctcag ccaatccctg ggtgagtttc accagttttg atttaaacgt 
ggccaatatg gacaacttct tcgcccccgt tttcaccatg ggcaaatatt atacgcaagg 
cgacaaggtg ctgatgccgc tggcgattca ggttcatcat gccgtttgtg atggcttcca 
tgtcggcaga atgcttaatg aattacaaca gtactgcgat gagtggcagg gcggggcgta 8880 
atttttttaa ggcagttatt ggtgccctta aacgcctggt gctacgcctg aataagtgat 8 940 
aataagcgga tgaatggcag aaattcgaaa gcaaattcga cccggtcgtc ggttcagggc 
agggtcgtta aatagccgct tatgtctatt gctggtttac cggtttattg actaccggaa 
gcagtgtgac cgtgtgcttc tcaaatgcct gaggccagtt tgctcaggct ctccccgtgg 
aggtaataat tgacgatatg atcatttatt ctgcctccca gagcctgata aaaacggtga 
atccgttagc gaggtgccgc cggcttccat tcaggtcgag gtggcccggc tccatgcacc 9240 
gcgacgcaac gcggggaggc agacaaggta tagggcggcg cctacaatcc atgccaaccc 9300 
gttccatgtg ctcgccgagg cggcataaat cgccgtgacg atcagcggtc cagtgatcga 
agttaggctg gtaagagccg cgagcgatcc ttgaagctgt ccctgatggt cgtcatctac 
ctgcctggac agcatggcct gcaacgcggg catcccgatg ccgccggaag cgagaagaat 
cataatgggg aaggccatcc agcctcgcgt cgcgaacgcc agcaagacgt agcccagcgc 
gtcggccgcc atgccggcga taatggcctg cttctcgccg aaacgtttgg tggcgggacc 
agtgacgaag gcttgagcga gggcgtgcaa gattccgaat accgcaagcg ac 9652 

<210> 8 

<211> 29 

<212> DNA 

<213> Rhodococcus AN12 


gtgcgaaaac tggacagctg gctacacta 

<210> 9 
<211> 19 


9000 
9060 
9120 
9180 


9360 
9420 
9480 
9540 
9600 


20 


<212> DNA 
<213> Primer 


<400> 9 

gagtttgatc ctggctcag 

<210> 10 

<211> 16 

<212> DNA 

<213> Primer 

<400> 10 
taccttgtta cgactt 

<210> 11 

<211> 17 

<212> DNA 

<213> Primer 


<400> 11 

gtgccagcag ymgcggt 


<210> 12 

<211> 1424 

<212> DNA 

<213> Rhodococcus AN12 


<400> 12 

tcgagcggta gagagaagct tgcttctctt gagagcggcg gacgggtgag taatgcctag 60 
gaatctgcct ggtagtgggg gataacgttc ggaaacggac gctaataccg catacgtcct 120 
acgggagaaa gcaggggacc ttcgggcctt gcgctatcag atgagcctag gtcggattag 180 
ctagttggtg aggtaatggc tcaccaaggc gacgatccgt aactggtctg agaggatgat 240 
cagtcacact ggaactgaga cacggtccag actcctacgg gaggcagcag tggggaatat 
tggacaatgg gcgaaagcct gatccagcca tgccgcgtgt gtgaagaagg tcttcggatt 
gtaaagcact ttaagttggg aggaagggca gttacctaat acgtgattgt tttgacgtta 
ccgacagaat aagcaccggc taactctgtg ccagcagccg cggtaataca gagggtgcaa 

21 


300 
360 
420 
480 


gcgttaatcg gaattactgg gcgtaaagcg cgcgtaggtg gtttgttaag ttggatgtga 540 
aatccccggg ctcaacctgg gaactgcatt caaaactgac tgactagagt atggtagagg 
gtggtggaat ttcctgtgta gcggtgaaat gcgtagatat aggaaggaac accagtggcg 
aaggcgacca cctggactga tactgacact gaggtgcgaa agcgtgggga gcaaacagga 
ttagataccc tggtagtcca cgccgtaaac gatgtcaact agccgttggg agccttgagc 780 
tcttagtggc gcagctaacg cattaagttg accgcctggg gagtacggcc gcaaggttaa 
aactcaaatg aattgacggg ggcccgcaca agcggtggag catgtggttt aattcgaagc 
aacgcgaaga accttaccag gccttgacat ccaatgaact ttctagagat agattggtgc 
cttcgggaac attgagacag gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg 
ggttaagtcc cgtaacgagc gcaacccttg tccttagtta ccagcacgta atggtgggca 
ctctaaggag actgccggtg acaaaccgga ggaaggtggg gatgacgtca agtcatcatg . 1140 
gcccttacgg cctgggctac acacgtgcta caatggtcgg tacagagggt tgccaagccg 
cgaggtggag ctaatcccag aaaaccgatc gtagtccgga tcgcagtctg caactcgact 
gcgtgaagtc ggaatcgcta gtaatcgcga atcagaatgt cgcggtgaat acgttcccgg 1320 
gccttgtaca caccgcccgt cacaccatgg gagtgggttg caccagaagt agctagtcta 1380 
accctcggga ggacggttac cacggtgtga ttcatgactg gggt 

<210> 13 

<211> 17 

<212> DNA 

<213> Primer 

<400> 13 

gtaaaacgac ggccagt 

<210> 14 

<211> 24 

<212> DNA 

<213> Primer 


<400> 14 

agcggataac aatttcacac agga 


600 
660 
720 


840 
900 
960 
1020 
1080 


1200 
1260 


1424 


22 


<210> 15 

<211> 722 

<212> DNA 

<213> Rhodococcus AN12 


<400> 15 
aagctttcca 

gcggaatgca 

agagcgtcgg 

gactcatggg 

ctgaacgaac 

ggtgacctgg 

60 

tgcagatccc 

ctcttccgac 

aggttctgtt 

ccatctgtca 

gtgcatgcag 

agagatacct 

120 

ggatacatcc 

gagtgaatcc 

ggaattgagc 

gccgatggga 

tccagtggac 

taacccagat 

180 

ggaaaggtca 

tcacgtcgcc 

gtactcgaag 

accagcgcgt 

gcggtgacgt 

gccagtgcct 

240 

gagggctggc 

gggcagtgta 

tttgaccgtg 

aatacccctg 

tcccggtaca 

taagggcgta 

300 

gaggctgaga 

cggttccaga 

gactttgatg 

agtgatcgag 

tgcaaagcaa 

tcttcagaat 

360 

ggaacctcgg 

gatgcgctct 

cgttccagtc 

gaatcgtggt 

tgtggaacgt 

agatgagcaa 

420 

gtcgaagtag 

atagtccgaa 

tgtcgttgtg 

gagtggcccc 

gatgagcaac 

tacgaagccg 

480 

ttcgacgcgg 

tgaccaggtt 

aaaagacgtt 

ccacctggca 

aattttgcga 

gcaaaactca 

540 

aggcaaaaat 

tgccgattac 

ccgattctgt 

cctctacgtt 

tctgttgctt 

ctgctgctgt 

600 

acatcttcga 

cgctcagatg 

tggcttttgg 

tcagtgtgct 

gctggtgtgc 

gttgtggcaa 

660 

tggtctacct 

gg 

gagagaggca 

acgaaggctc 

ggcgtcgcaa 

acgtcgtaca 

gctcgatggt 

720 
722 


Ul <210> 16 

<211> 523 
<212> DNA 


<213> Rhodococcus AN12 


<400> 16 

aagcttccgc acgttgagcc aagagcaaca ttctgatgcc tgctttggca gactgagcag 60 

cgatctgtcg aacaagcgat gtgatgcggg gtgcgtatct gtctgctggt ttcagaccgt 120 

tggttgcatc gaaatcctgt gcgccctcga tgattccggg aaattcttcc agtacgagca 180 

ggatgagagg tagtgctggc gagaacaacg aaattttgtc tatgcgtcga tcccagaaac 240 

actcgattcg tcggtcagat tctgctttga cgaactggag cactcggagg actttgtcaa 300 

aatcgttcag cccgagctcg atgttcggtt casccggtcg tcggtggacg aatggggcta 360 

gtaagacgga agtgggatcg actccgacga cacgcacagc gggattcgat ccagcctgtg 420 


23 


nj 
m 

a 


cgagcaggtt gtaggtgcat tgagattttc cggaacgggt tttgccttga atgagccagt 
gagcagcatc tttggctata tcgaccatga caggttgagc tag 

<210> 17 

<211> 606 

<212> DNA 

<213> Rhodococcus AN12 


<400> 17 
aagcttccat 

cgtgggtgga 

aacgcccgct 

cgaacttcgc 

ggtgaaaatg 

actctccgcg 

60 

tagacgaacc 

tgaatctgtc 

aaaatgctgc 

accccaacgc 

aacacctgaa 

gagtgcgcac 

120 

tggtcgaagg 

attcgtccct 

ggtcaaggct 

tcttcgacca 

acccggacta 

cggcgccaaa 

180 

tgatccgaac 

ggttcgcgta 

ggtgagtact 

cgacctacgc 

gagttacgtc 

gaaaacgcag 

240 

acctcgcgta 

cgaagccgca 

ctgaacatcg 

accgagcaca 

acgaatgaca 

atcgcctcgg 

300 

aatacccaca 

tctcggcgac 

ataggctgac 

aaccgaacac 

acaggaggac 

acaccttgat 

360 

cggctacccg 

acagacgcaa 

tcccggtaaa 

cacctacatt 

cgacagcaat 

ttgagaaggt 

420 

tgcacatgag 

gcaggagaaa 

aactcgcctc 

acgccgaaac 

ctgcccacgg 

aacgagtcgt 

480 

aacgactgca 

ctccggatca 

aatcaggctg 

gccgaacgat 

catctcgtaa 

taactgaaat 

540 

actcagggcc 

agagtaggtt 

tggaaggcca 

agctgtcgtc 

gacgaacttc 

gcggcatgca 

600 

gatcac 






606 


<210> 18 

<211> 30 

<212> DNA 

<213> Primer 


<400> 18 

actttattgt catagtttag atctattttg 


<210> 19 

<211> 20 

<212> DNA 

<213> Primer 


24 


<400> 19 

acttgcgaac cgatattatc 


<210> 

20 

<211> 

20 

<212> 

DNA 

<213> 

Primer 


<400> 20 

ttatgaccag cgtaagtgct 

<210> 21 
<211> 459 
<212> PRT 

<213> Arcanobacterium pyogenes 
<400> 21 

Met Asn Arg Leu Ser Glu Arg Thr Ala Leu Ser Leu Pro Ala Arg Gin 
15 10 15 

lie Gin Lys Val lie Pro Ala Ala Gly Gly Arg Ser Leu Lys Ser Phe 
20 25 30 

Glu Gly Met Thr Ala Thr Trp Ser Ala Arg Gly Gly Ala Ser Ser Asp 
35 40 45 

Glu Arg Ser Arg Asp Lys Arg Ser Gin lie Pro Ser Asn Arg Arg Glu 
50 55 60 

Gly Arg Ser Ala Thr His Pro Leu Gly Asn Thr Val Leu Thr Phe Pro 
65 70 75 80 

Val Ser Asn Glu Ser Lys Lys Thr Ala Lys Ser Arg Arg Ser Glu Arg 
85 " 90 95 

Tyr Glu Leu Arg Asp Gly Leu Ala Glu lie Ser Thr lie Glu Ser Val 
100 105 110 

Arg Lys Cys Gly Arg Val Pro Val Ala Pro Leu Val Ser Leu Arg Ala 
115 120 125 

Lys Ser Asp Gly Lys Gly Ala Gly Tyr Gly Gly Leu His Thr Cys Gly 
130 135 140 
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Ser Val Trp Ala Cys Pro Val Cys Ser Ala Lys lie Ala Ala Arg Arg 
145 ' 150 155 160 


Lys Thr Asp Leu Gin Gin Val Val Asp His Ala Val Lys His Gly Met 
165 170 175 

Thr Val Ser Met Leu Thr Leu Thr Gin Arg His His Lys Gly Gin Gly 
180 185 190 

Leu Lys His Leu Trp Asp Ala Leu Ser Thr Ala Trp Asn Arg Val Thr 
195 200 205 

Ser Gly Arg Arg Trp lie Glu Phe Lys Glu Gin Phe Gly Leu Val Gly 
210 215 220 

Tyr Val Arg Ala Asn Glu He Thr His Gly Lys His Gly Trp His Val 
225 230 235 240 

§*& 

0 His Ser His Val Leu He He Ser Glu Lys Asp Pro Leu Thr Ser Thr 

O 245 250 255 

3 

ill Phe Val Tyr Gin Arg Lys Gin Gly Arg Arg Arg Leu Pro Tyr Pro Pro 

W 260 265 270 

H 

%$. Glu He Tyr Met Ser Ser Asp Phe He Ala Glu Arg Trp Glu Ala Gly 

m 275 280 285 

81 Leu Ala Lys His Gly Val Asp Phe Leu Arg Asp Ser Gly Gly Leu Asp 

0 290 295 300 

u 

Trp Thr Val Ala Lys Asp Ala Arg Ala He Gly Asn Tyr Val Ser Lys 
305 310 315 320 

Met Gin Thr Ser Thr Asp Ala He Ser Ser Glu Val Thr Leu Gly Gly 
325 330 335 

Phe Lys Lys Ala Arg Asn Gly Asn Arg Thr Pro Phe Gin lie Leu Ala 
340 345 350 

Asp He Leu Ser Leu Gly Asp Val Asp Asp Leu Lys Leu Trp Lys Glu 
355 360 365 

Tvr Glu Lys Ala Ser Phe Gly Arg Arg Ala Leu Thr Trp Ser Lys Gly 
370 375 " 380 

Leu Arg Asp Trp Ala Asn Leu Gly Val Glu Gin Ser Asp Glu Glu He 
385 390 395 400 
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Ala Ser Glu Glu lie Gly Asp Glu Ala lie Ala Leu Phe Thr His Asp 
405 410 415 


SI 

in 


Ala Trp Arg Gin Val Arg Arg Phe Gly Ala Ala Glu Leu Leu Asp Val 

420 425 430 

Thr Glu Ser Gly Gly Arg Ala Ala Ala Tyr Arg Trp Leu Asp Phe Arg 
435 440 445 

Glu lie Asp Trp Ser Leu Pro Pro Lys lie Glu 
450 ~ 455 

<210> 22 
<211> 456 
<212> PRT 

<213> Streptomyces lividans 
<400> 22 

Met Asp Pro Ala Ser Gly Val lie Val Ala Gin Thr Ala Ala Gly Thr 
W 15 10 15 

M 

|& Ser Val Val Leu Gly Leu Met Arg Cys Gly Arg He Trp Leu Cys Pro 


Val Cys Ala Ala Thr He Arg His Lys Arg Ala Glu Glu He Thr Ala 
35 40 45 


Ala Val Val Glu Trp lie Lys Arg Gly Gly Thr Ala Tyr Leu Val Thr 


Phe Thr Ala Arg His Gly His Thr Asp Arg Leu Ala Asp Leu Met Asp 
65 70 75 80 

Ala Leu Gin Gly Thr Arg Lys Thr Pro Asp Ser Pro Arg Arg Pro Gly 


Ala Tyr Gin Arg Leu lie Thr Gly Gly Thr Trp Ala Gly Arg Arg Ala 

100 105 110 

Lys Asp Gly His Arg Ala Ala Asp Arg Glu Gly He Arg Asp Arg He 

115 120 125 

Gly Tyr Val Gly Met lie Arg Ala Thr Glu Val Thr Val Gly Gin lie 
130 135 140 
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Asn Gly Trp His Pro His lie His Ala He Val Leu Val Gly Gly Arg 
145 150 155 160 

Thr Glu Gly Glu Arg Ser Ala Lys Gin He Val Ala Thr Phe Glu Pro 
165 170 175 

Thr Gly Ala Ala Leu Asp Glu Trp Gin Gly His Trp Arg Ser Val Trp 
180 185 190 

Thr Ala Ala Leu Arg Lys Val Asn Pro Ala Phe Thr Pro Asp Asp Arg 
195 200 205 

His Gly Val Asp Phe Lys Arg Leu Glu Thr Glu Arg Asp Ala Asn Asp 
210 215 220 

Leu Ala Glu Tyr He Ala Lys Thr Gin Asp Gly Lys Ala Pro Ala Leu 
225 ' 230 235 240 

u 

f'-'i Glu Leu Ala Arg Ala Asp Leu Lys Thr Ala Thr Gly Gly Asn Val Ala 

pj 245 250 255 

H 

sfi Pro Phe Glu Leu Leu Gly Arg He Gly Asp Leu Thr Gly Gly Met Thr 

!U 260 265 270 

l s Glu Asp Asp Ala Ala Gly Val Gly Ser Leu Glu Trp Asn Leu Ser Arg 

rr; 275 280 285 

HI Trp His Glu Tyr Glu Arg Ala Thr Arg Gly Arg Arg Ala He Glu Trp 

Q 290 295 300 

Thr Arg Tyr Leu Arg Gin Met Leu Gly Leu Asp Gly Gly Asp Thr Glu 
305 310 315 320 

Ala Asp Asp Leu Asp Leu Leu Leu Ala Ala Asp Ala Asp Gly Gly Glu 
325 330 335 

Leu Arg Ala Gly Val Ala Val Thr Glu Asp Gly Trp His Ala Val Thr 
340 345 350 

Arg Arg Ala Leu Asp Leu Glu Ala Thr Arg Ala Ala Glu Gly Lys Asp 
355 360 365 

Gly Asn Glu Asp Pro Ala Ala Val Gly Glu Arg Val Arg Glu Val Leu 
370 375 380 

Ala Leu Ala Asp Ala Ala Asp Thr Val Val Val Leu Thr Ala Gly Glu 
385 390 395 400 
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Val Ala Glu Ala Tyr Ala Asp Met Leu Ala Ala Leu Ala Gin Arg Arg 
405 410 415 


Glu Glu Ala Thr Ala Arg Arg Arg Arg Glu Gin Asp Asp Asp Gin Asp 
420 425 430 


i Asp Ala Asp Asp Arg Gin Glu Arg Ala Ala Arg His lie Ala Arg 
435 440 445 


Leu Ala Ser Gly Pro Thr Ser His 
450 455 


<210> 23 

<211> 528 

<212> PRT 

<213> Streptomyces phaeochromogenes 


<400> 23 

Met Leu Asn Arg Val Ser Gly lie Asp Ala Cys Gly Gly Cys Gly Arg 


Arg Val Leu Asp Pro Asp Thr Gly Val lie Tyr Ala Lys Ser Ser Arg 
20 25 30 

Gly Tyr Val Val Thr lie Gly Leu Val Arg Cys Gly Arg lie Trp Phe 
35 40 45 

Cys Pro Glu Cys Ser Ser Ala lie Arg Arg Gly Arg Thr Glu Glu He 
50 55 60 

Lys Thr Gly Ala Leu Arg His Leu Ala Ala Gly Gly Thr Leu Ala Val 
65 " 70 75 80 


Val Val Leu Thr Ala Arg His Asn Gin Thr Thr Asp Leu Asp Ser Leu 


Val Ala Ala Leu Trp Gly Gly Pro Leu Leu Asp Asp Lys Gly Ala Pro 

100 105 HO 

Val Leu Asp Arg Ser Gly Lys Pro Arg Arg Ala Pro Gly Ala Tyr Gin 

115 120 125 

Arg Met Leu Thr Ala Pro Ala Phe Tyr Gly Arg Pro Glu Ala Arg Arg 
130 135 140 
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SI 


Thr Arg Lys Asp Gly Thr Gin Tyr Val Arg Pro Ala Glu Asp Gly lie 
145 150 155 160 

Arg His Arg lie Gly Tyr lie Gly Met Val Arg Ala Ala Glu Val Thr 
165 170 175 


Arg Ser Lys Lys Asn Gly Tyr His Pro His Leu Asn Leu Leu Val Phe 

180 185 190 


Leu Gly Gly Glu Leu Ser Gly Thr Pro Ala Lys Gly Asp Val Val Gly 
195 200 205 


His Phe Glu Pro Ser Glu Thr Asp Leu Gly Asp Trp Glu Asp Trp Leu 
210 215 220 


Arg Glu Met Trp Ala Gly Ala Leu Lys Arg Ala Asp Pro Lys Phe Glu 
225 230 235 240 

Pro Ser Thr Asp Cys Asp Thr Pro Gly Cys Lys Cys Lys Gly Lys Gly 
245 250 255 


His Gly Val Met Val Ser lie Val Arg Ser Ala Asp Asp Val Ala Leu 
260 265 270 


lie Glu Tyr Leu Thr Lys Asn Gin Asp Gly Lys Arg Glu Arg Pro Asp 
275 280 285 

Ser Val Asp Gin Asp Leu Glu Ala Ala Gly Ala Ala Ala Met Glu Thr 
290 295 300 


Ala Arg Leu Asp Ser Lys Thr Gly Arg Gly Arg Lys Ser Met Thr Pro 
305 310 " 315 320 

Phe Gin lie Leu Tyr Arg Leu Trp Asp lie Glu Val Ala Gly Leu Asp 

325 330 335 


Pro Asp Met Ala Glu Gly Tyr Gly Thr Pro Lys Gin Leu Arg Ala Trp 
340 345 350 

Trp Ala Gin Tyr Glu Glu Ala Leu Ala Gly Arg Arg Ala lie Glu Trp 

355 360 365 

Thr Arg Gly Leu Arg Arg His Val Asp Leu Asp Gly Asp Asp Asp Glu 
370 375 380 

Glu Thr Asp Leu Gin Tyr Val Tyr Glu Pro Glu Ala Ala Pro Leu Asp 

385 390 395 400 
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Gly Gly Val Val Leu Thr Ser Asp Ala Met Arg Leu Val Val Gly Ala 
405 410 415 

Asp Ala Glu Leu Asp Leu Asp Asp Val Val Arg Ala Glu Ala Tyr Tyr 
420 425 430 

Ser Ala Val Asp Val Val Thr Gly Leu Gly Gly Arg Ala Asp His Val 
435 440 445 

Arg Val Ala Thr Ala Glu Glu Leu Ala Glu Val Gin Glu Val Leu Phe 
450 455 460 

Ala Arg Thr Gin Glu Arg Ala Glu Glu Ser Arg Arg Gin Arg Arg lie 
465 ' 470 475 480 

Ala Glu His Glu Ala Glu Gin Ala Ala Ala His Arg Lys Arg Gin Glu 
485 490 495 

M> 

13 

Ti Leu Ala Arg Cys Leu Gly Leu Leu Val Arg Gin Arg Gly Gly Thr Gin 

f % 500 505 510 

X S\ Asp Asp Ser Ala Ala Asp Asn Phe Val Ala His lie His Ala Asn Arg 

fU 515 520 525 

NJ 

U <210> 24 

iy <211> 451 

U 

111 <212> PRT 

<213> Streptomyces nigirif aciens 


Met Asp Pro Ala Ser Gly Val lie Val Ala Gin Thr Ala Ala Gly Thr 
15 10 15 


Ser Val Val Leu Gly Leu Met Arg Cys Gly Arg lie Trp Leu Cys Pro 
20 25 30 

Val Cys Ala Ala Thr lie Arg His Lys Arg Ala Glu Glu lie Thr Ala 
35 40 45 

Ala Val Val Glu Trp lie Lys Arg Gly Gly Thr Ala Tyr Leu Val Thr 
50 55 60 


Phe Thr Ala Arg His Gly His Thr Asp Arg Leu Ala Asp Leu Met Asp 
65 70 75 80 
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Ala Leu Gin Gly Thr Arg Lys Thr Ala Asp Ala Pro Arg Arg Pro Gly 
85 90 95 


Ala Tyr Gin Arg Leu lie Thr Gly Gly Thr Trp Ala Gly Arg Arg Ala 
100 105 110 

Lys Asp Gly His Arg Ala Ala Asp Arg Glu Gly lie Arg Asp Arg lie 
115 120 125 

Gly Tyr Val Gly Met lie Arg Ala Thr Glu Val Thr Val Gly Gin lie 
130 135 140 

Asn Gly Trp His Pro His lie His Ala lie Val Leu Val Gly Gly Arg 
145 ~ 150 155 160 

Thr Glu Gly Glu Arg Ser Ala Lys Gin lie Val Gly Thr Phe Glu Pro 
165 170 175 

o 

fl Ser Glu Ala Ala Leu Asp Glu Trp Gin Gly Gin Trp Arg Ala Val Trp 

Q 180 185 190 

Thr Ala Ala Leu Arg Lys Val Asn Pro Gin Phe Thr Pro Asp Asp Arg 
flf 195 200 205 

U- His Gly Val Asp Phe Lys Arg Leu Glu Thr Glu Arg Asp Ala Asn Asp 

?■< 210 215 220 

P; 

Hi Leu Ala Glu Tyr lie Ala Lys Thr Gin Asp Gly Lys Ala Pro Ala Leu 

Ff 225 " 230 235 240 

I* 

Glu Leu Ala Arg Ala Asp Leu Lys Thr Ala Asn Gly Gly Asn Val Ala 
245 250 255 

Pro Phe Glu Leu Leu Gly Arg lie Gly Asp Leu Thr Gly Gly Met Thr 
260 265 270 

Glu Asp Asp Ala Ala Gly Val Gly Ser Leu Glu Trp Asn Leu Ala Arg 

275 280 285 

Trp His Glu Tyr Glu Arg Ala Thr Lys Gly Arg Arg Ala lie Glu Trp 
290 295 300 

Thr Arg Tyr Leu Arg Gin Met Leu Gly Leu Asp Gly Gly Asp Thr Glu 
305 ' 310 315 320 

Ala Asp Asp Leu Asp Leu Leu Leu Ala Ala Asp Ala Asp Gly Gly Glu 
325 330 335 
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Leu Arg Ala Gly Val Ala Val Thr Glu Asp Gly Trp His Ala Val Thr 

340 345 350 


Arg Arg Ala Leu Asp Leu Ala Ala Thr Gin Ala Ala Glu Gly Thr Asp 
355 360 365 


Gly Asn Thr Asp Pro Ala Ala Met Gly Glu Arg Val Arg Glu Val Leu 
370 375 380 


Ala His Ala Asp Ala Ala Asp Ala Val Val Val Leu Thr Ser Gly Glu 

385 390 395 400 


Val Ala Glu Ala Tyr Ala Asp Met Leu Ala Ala Leu Ala Leu Arg Arg 
405 410 415 


Glu Glu Ala Ala Ala Arg Arg Arg Arg Glu Gin Asp Asp Asp Gin Asp 
420 425 430 


Asp Asp Ala Asp Asp Arg Gin Glu Arg Ala Ala Arg His lie Ala Arg 


Leu Arg Asn 
450 


<210> 25 

<211> 30 

<212> DNA 

<213> Streptomyces lividans 


<400> 25 

gaggcaaaag cgaacacctt gggaaagaaa 


<210> 26 

<211> 30 

<212> DNA 

<213> Streptomyces phaeochromogenes 


<400> 26 

ctggcaaaaa gggacgccta ggtaaaggtt 
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<210> 27 
<211> 31 
<212> DNA 

<213> Streptomyces nigirif aciens 
<400> 27 

gacccaaaac tgtcgcgcct tgggaaagaa a 

<210> 28 

<211> 20 

<212> DNA 

<213> Primer 

<400> 28 

atttcgttga acggctcgcc 

<210> 29 

<211> 20 

<212> DNA 

<213> Primer 

<400> 29 

cggcaatccg acctctacca 

<210> 30 

<211> 20 

<212> DNA 

<213> Primer 

<400> 30 

tgagacgagc cgtcagcctt 


